ReefCloud
Coral cover changes at large spatial scales
1 Introduction
This document complements information displays on the ReefCloud Dashboard (https://reefcloud.ai/). It contains a description about the purposes of the spatio-temporal model adopted by ReefCloud to predict coral cover changes across spatial scales and how to interpret outputs.
We present the different aspects of the modelling framework using a working example with reproducible R codes available at https://github.com/ReefCloud.
2 Methods
2.1 Coral data
The working example uses observations of coral cover from the ongoing Long Term Monitoring Program (LTMP) (Emslie et al. 2020). In the region of the working example, the LTMP surveys seven coral reefs with three sites of five permanently marked transects at each reef. Observations from 2004 to 2022 at the transect scale are used in the model (Figure 1). These observations are derived from the outputs of the machine learning algorithms from the ReefCloud platform.
2.2 Spatial scales of aggregation
To predict coral cover changes across large spatial scales, ReefCloud uses four different spatial scales available globally:
- tier2 - country level
- tier3 - levels of management regions
- tier4 - levels of marine ecoregions of the world (Spalding et al. 2007)
- tier5 - 5x5 km hexagonal grid
In the ReefCloud framework, coral cover observations can be extracted using levels from tier2, tier3 or tier4. In our working example, coral cover data are extracted within the limits of each tier4 level (Figure 2) due to the size of the country. Indeed, the area of the unit 1808 corresponds roughly to the size of Spain (14 millions hectares). This tier4 unit is composed of 1110 tier5 cells and 33 of them contain long-term observations in coral cover (Figure 3). Additional information within each tier5 cell includes the area occupied by coral reefs and unique reef identifier that were derived from the Reefs at Risk Revisited Project (Burke et al. (2011)).
2.3 Disturbance data
Cyclone exposure
The spatial distribution of likely cyclone impacts was estimated using the 4MW model (Puotinen et al. 2016), which predicts the duration of damaging waves from cyclones by year. It is based on the duration and speed of modelled cyclonic winds, and estimates on the tier5 grid. The model defines damaging waves as the average of the highest one-third of wave heights over a sustained period of high winds, and which are four meters in amplitude or greater.
In the region of the working example, we detected increasing wave intensity in the years 2017 and 2018 associated with the cyclones Debbie that hit the region in 2017 at category 4 level and category 2 Iris in 2018 (Figure 4).
Heat stress
Degree Heating Weeks (DHW) was used to estimate heat stress impacts potentially leading to mass coral bleaching. DHW is derived from the Coral Bleaching HotSpot product that provides an instantaneous estimate of heat stress at a spatial resolution of 5 km over a 12-week window (Liu et al. 2018). The maximum DHW per year was used as indicator of heat stress event in the model. The year associated with DHW is modified when the maximum DHW occurred after field observations of the same year. In this case, associated DHW value is used for the next year.
In the region of the working example, heat stress events are detected for the years 2016, 2017 and 2020 (Figure 5).
2.4 QA/QC
Prior analyses found a pervasive model behavior in the presence of extreme disturbance events in locations without data. To ensure quality and accuracy in model predictions, values of cyclone exposure and heat stress were not accounted (values replaced by NAs) when two following conditions applied:
- values greater than the 97.5% quantiles of the tier4 distribution
- absence of data in the corresponding tier5
For the region of the working example, values greater than 28 hours of exposure to cyclonic waves and 8 DHWs were considered out of the distribution. In 2017, values of cyclone exposure were not accounted for 50.3% of cells (n = 559) and 6.7% (n = 75) in 2018 (Figure 6). Some values of heat stress were not accounted for the years 2016, 2017 and 2020, with a maximum of 35.5% of cells (n = 394) in 2017 (Figure 7).
2.5 Spatio-temporal model
Introduction to the model
A spatio temporal model is developed to fit the following purposes:
- using fine-scale observations of coral cover from monitoring surveys
- predict changes and associated uncertainty across spatial scales
- estimates of drivers of coral cover change
We addressed these criteria by modelling coral dynamics in space and time using a fixed-rank approach suitable for big data settings and non-Gaussian responses (Zammit-Mangion and Cressie 2021). The model uses spatio-temporal random effects that are based on an automatic specification of spatio-temporal basis functions constructed via a tensor product of spatial and temporal basis functions. These effects are spatially correlated to introduce a dependence (defined as spatial auto-correlation) between nearby tier5 cells and interpolate values of coral cover at unobserved cells. In addition, coral dynamics is estimated while considering for the effects of heat stress events and cyclones at lag 0, 1 and 2 and reef level variations. Trajectories of coral cover are scaled-up by weighting posterior distributions by reef area within each tier5 cell and summing them by year. The posterior median coral cover and associated uncertainty are estimated from the 95% highest posterior density intervals of their respective distributions (Kay 2023) based on 400 draws. In the ReefCloud pipeline, this process is repeated across all tier5 cells within tier4, tier3 and tier2 to produce coral cover trajectory across large spatial scales. In this working example, the spatio-temporal model is implemented using coral cover observations from a unique tier4 unit.
The spatio-temporal hierarchical generalized linear mixed model consists of two conditional-probability layers (Sainsbury-Dale, Zammit-Mangion, and Cressie 2021). In the process layer, coral dynamics is modelled according to different types of spatio-temporal variability attributed to disturbances, random variations that occurred at the medium spatial scales (\(\mathrm{v}\)) and other independent variations at fine scale (\(\mathrm{\zeta}\)) and reef scale (\(\mathrm{V}\)). In the data layer, the nature of the data is used to determine the probability distribution of the model. Here, we use a binomial distribution with coral cover expressed as counts per tier5 cell. Indeed, ReefCloud methodology uses a sample of 50 points within each image, so that abundances of corals \(y_{it}\) for observation \(i\) sampled at location \(\boldsymbol{s}\) and time \(t\) are integers between 0 and 50. The binomial distribution is a function of a mean latent process, \(\mu_{it}\) that are then averaged across tier5 cells by year and linked to the process layer via an inverse-logit transformation. This step facilitates the spatial change-of-support between observations in coral cover at the transect level and model predictions at the tier5 level.
Model equations
\[ \begin{align} {y_{it}} &\overset{\rm ind}{\sim} {\sf{Bin}}\bigg({\rm logit}^{-1}(\rm{Y}(\boldsymbol{s}))\bigg) , \quad \boldsymbol{s} \in \rm{Tier4}\\ \rm{Y}(\boldsymbol{s}) &= t(\boldsymbol{s})^\intercal\alpha + v(\boldsymbol{s}) +\zeta(\boldsymbol{s}) + \rm{V}(\boldsymbol{s}),\\ v(s) &= \phi(\boldsymbol{s})^\intercal\eta,\\ \eta &\sim \rm{Gau}(0,\rm{Q}^{-1}),\\ \zeta, \rm{V} &\sim \rm{Gau}(0, \sigma_{.}^{2})\\ \end{align} \tag{1}\]
where \(\rm{Y}(\boldsymbol{s})\) corresponds to the averaged mean latent process with \(\boldsymbol{s}\) the location of tier5 units within a tier4 area. \(t(\boldsymbol{s})^\intercal\) and \(\phi(\boldsymbol{s})^\intercal\) are estimated using known design matrices constructed from spatially referenced covariates and spatio-temporal basis functions, respectively. \(\alpha\) represents fixed effects of each covariate - effects of of stress events and cyclones at lag 0, 1 and 2. The medium spatial scale variation, \(\rm{v}\), is also conditioned on a vector of random coefficients \(\eta\) modelled as a multivariate Gaussian random vector with an inversed covariance kernel, \(\mathrm{Q}^{-1}\). \(\zeta\) and \(\rm{V}\) are independent random effects at the fine-scale and reef level, respectively.
Model implementation
The spatio-temporal model is fitted under an hybrid Bayesian and Frequentist framework using the R package FRK version 2.1.5 (Sainsbury-Dale, Zammit-Mangion, and Cressie 2021). A Monte Carlo approach is used to infer on the averaged mean latent process (\(\rm{Y}(\boldsymbol{s})\)) and obtain a posterior predictive distribution of coral cover for each tier5. Then model parameters and and fixed effects are treated as non-random quantities where only a point estimate is kept by the model, in line with FRK v1 (Zammit-Mangion and Cressie 2021). This framework is key for ReefCloud because it allows to substantially decrease the computational speed of the model.
The best model formulation was selected using visual and statistical diagnostics including model fit, residual patterns using the DHARMaR package (Hartig 2023), basis dimensions and Akaike Information Criterion (AIC) values. In addition, because model predictions are on a different spatial scale than the observations, we developed a leave-out data approach to proceed further evaluations of model performance. We used five prediction-performance measures to assess model predictions with the exclusion of different data (random data and by blocks). Results from these analyses are shown in Section 5.
3 Results
3.1 Tier5
At the tier5 level, predictive values in coral cover and associated uncertainty changed in space and time (Figure 8; Figure 9). It is expected when using spatio-temporal model that the uncertainty remains low at predictive locations with and close to the data and increase further away their locations. In this working example, we observed this with a large uncertainty in the southern section of the predictive area ranging between 60-100% and varying across years (Figure 9). Despite, the model captures well coral cover trajectories in tier5 with data (Figure 10) and surrounding cells (Figure 12). Predictions in coral cover are also available outside the temporal range of observed values as shown for the tier5 “7116” where observations in coral cover are only available for the years 2007-2013 whereas the model predicted changes from 2004 to 2022 (Figure 10).
Spatio-temporal predictions
Uncertainty associated with spatio-temporal predictions
Coral cover trajectories of tier5 with data
Coral cover trajectories of tier5 without data
3.2 Tier4
At the tier4 level, coral cover is estimated in square kilometer because the predictions were weighted by the area of coral reef within each tier5 cell. In this working example, predicted coral cover trajectory at the tier4 level remained relatively constant before decreasing in 2012. An increased in coral cover was predicted from 2012-2017 reaching almost 500 sq km before decreasing again in 2019 (Figure 13). We found that cyclone impact at lag1 drove coral cover loss in this region (Figure 14) - 95% credible intervals are negative. Predicted decreased in 2012 could be associated with the impact of category 5 cyclone Yasi that hit the region in February 2011. The model estimates an absolute loss in coral cover of 105 (63-143, 95% CI) sq km for the entire region. Similarly, the decrease between 2018-2019 could be attributed to cyclone Debbie that hit the region at category 4 in March 2017. We estimated that 120 (71-171, 95% CI) sq km of reef area was loss by this disturbance.
Predicted trajectory
Effect of disturbances
4 Acknowledgements
The authors would like to thank the research team that contribute to the model and pipeline development. We are grateful to collaborate with the fathers of the FRK R package Andrew Zammit Mangion and Matthew Sainsbury-Dale. We thank Marji Puotinen and Ronen Galaiduk for their work with the disturbance spatial layers, Luz Valerie Pascal to improve the ReefCloud modelling pipeline and Brendan Ford to create the report template. This research was funded by the Australian Institute of Marine Science and the Australian Department of Foreign Affairs and Trade.